FAN , AILIN . New Statistical Methods for Precision Medicine : Variable Selection for Optimal
نویسندگان
چکیده
FAN, AILIN. New Statistical Methods for Precision Medicine: Variable Selection for Optimal Dynamic Treatment Regimes and Subgroup Detection. (Under the direction of Dr. Wenbin Lu and Dr. Rui Song.) Due to patients’ heterogeneity and a growing number of specifically targeted treatments, precision medicine draws attentions for customization of therapies and medical decisions for individual patient. In this dissertation, we investigate statistical methods to address two problems in precision medicine. The first problem is variable selection for optimal dynamic treatment regime. Variable selection is gaining more attention because it plays an important role in deriving practical and reliable optimal treatment regimes, especially when there are a large number of predictors. The second problem is subgroup detection of patients with enhanced treatment effects. By assessing heterogeneous treatment effects based on a variety of covariates, subgroup detection helps narrow down the target population of a treatment. In Chapter 2, we develop a sequential advantage selection method for variable selection for optimal treatment regime. Variables that have qualitative interactions with treatment are of clinical importance for treatment decision-making. A qualitative interaction of a variable with treatment arises when the treatment effect changes direction as the value of the variable varies. Our sequential advantage selection method sequentially selects variables with a qualitative interaction and can be applied in multiple-decision-point settings. Numerical studies suggest that the proposed method is useful in identifying important variables under various underlying true models. In Chapter 3, we propose a penalized A-learning method for deriving the optimal dynamic treatment regime when the number of covariates is of the non-polynomial order of the sample size. To preserve the double robustness property of the A-learning method, we adopt the Dantzig selector which directly penalizes the A-learning estimating equations. Simulation studies show that the proposed method achieves good performance in terms of both variable selection and estimated optimal treatment regimes. In Chapter 4, we propose a systematic method for subgroup detection. Our method first tests the existence of a subgroup, and then identifies the subgroup if the null hypothesis on nonexistence of such a subgroup is rejected. A semiparametric model for response is considered for model flexibility, and a doubly-robust test statistic is constructed based on this model. Moreover, a sample size calculation method for subgroup detection is developed based on the proposed statistic. Simulation studies are provided to illustrate the empirical performance of the proposed methods for subgroup detection and sample size calculation.
منابع مشابه
An Overview of the New Feature Selection Methods in Finite Mixture of Regression Models
Variable (feature) selection has attracted much attention in contemporary statistical learning and recent scientific research. This is mainly due to the rapid advancement in modern technology that allows scientists to collect data of unprecedented size and complexity. One type of statistical problem in such applications is concerned with modeling an output variable as a function of a sma...
متن کاملProbing for Sparse and Fast Variable Selection with Model-Based Boosting
We present a new variable selection method based on model-based gradient boosting and randomly permuted variables. Model-based boosting is a tool to fit a statistical model while performing variable selection at the same time. A drawback of the fitting lies in the need of multiple model fits on slightly altered data (e.g., cross-validation or bootstrap) to find the optimal number of boosting it...
متن کاملVariable selection for optimal treatment decision.
In decision-making on optimal treatment strategies, it is of great importance to identify variables that are involved in the decision rule, i.e. those interacting with the treatment. Effective variable selection helps to improve the prediction accuracy and enhance the interpretability of the decision rule. We propose a new penalized regression framework which can simultaneously estimate the opt...
متن کاملA Selective Overview of Variable Selection in High Dimensional Feature Space.
High dimensional statistical problems arise from diverse fields of scientific research and technological development. Variable selection plays a pivotal role in contemporary statistical learning and scientific discoveries. The traditional idea of best subset selection methods, which can be regarded as a specific form of penalized likelihood, is computationally too expensive for many modern stat...
متن کاملStatistical analysis of big data on pharmacogenomics.
This paper discusses statistical methods for estimating complex correlation structure from large pharmacogenomic datasets. We selectively review several prominent statistical methods for estimating large covariance matrix for understanding correlation structure, inverse covariance matrix for network modeling, large-scale simultaneous tests for selecting significantly differently expressed genes...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016